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Abstract 

Internet based education media are developing at an amazing rate and being seen as an upstart that is likely to dominate 
traditional education means worldwide in the future. This paper presents and discusses empirical results of a 
comparative analysis of user preferences for four major types of internet-based education media being used in China. In 
this paper, we first sort the currently prevalent internet-based education media in China into four basic types and 
summarize their different characteristics. We then categorize the users by a two-step procedure of clustering analysis, 
based on which we finally figure out the preference of each user cluster and identify the major influencing factors. 

Keywords: comparative analysis, internet-based education media, two-step clustering 

1. Introduction 

Wide access of education to the public by using effective information technologies is one aspect of the successes of 
today’s society (Colwell, 2001; Starke-Meyerring and Andrews, 2006). As the whole world steps into the big data era, 
the use of internet-based means in education has become increasingly significant. Different types of internet-based 
education media have been developed and the number is growing at an amazing rate. Internet-based media provide the 
promise of increased access to high quality education for the public ((Bates, 2003; Moore and Anderson, 2003). Apart 
from some world-renowned online education platforms such as MOOC (Massive Online Open Courses) and the 
long-existing online education communities, multiple other internet-based education media have been developed in 
China and are being widely used by users of various backgrounds. 

The evolution of the modern world economy calls urgently for a well-educated workforce (Dykman and Davis, 2008). For 
example, the three I’s (India, Indonesia, and Ireland) and more recently China, are finding that the demand for an educated 
workforce is overwhelming the capacities of their traditional educational systems (Brown, Murphy, and Wade, 2006; Chen, 
2007; Jacob and Szirmai, 2007; Tilak, 2007; Dykman and Davis, 2008). Seeing the growing importance of internet-based 
education, especially in the context of developing countries, in the current study we implement a comparative analysis of 
user preferences for different internet-based education media in China and identify their influencing factors. The results 
may well be used by developers of those media in making improvements related to their products and services, or by 
designers who are currently engaged in finding niches in developing new ways of online education. 

The rest of this paper is organized as follows. Section 2 is devoted to a literature review. In Section 3, we present a brief 
introduction to the methodology being applied to the comparative analysis in this paper. In Section 4, which is the main 
body of the current study, we describe our research procedure, present our findings, and hold related discussions on the 
important issues. Section 5 presents the final results of our comparative analysis. Finally, Section 6 concludes our 
analysis by summarizing and highlighting our research findings in this paper. 

2. Literature Review 

While very few literature directly related to online education media is available, certain preference analyses using 
similar methods on other subjects give abundant inspirations to this paper, especially in terms of the methodology. 
Undoubtedly, the principles of preference analysis and clustering method themselves play a crucial role in guiding the 
analyzing process. Wended Smith (1956) put forward the theory of market segmentation, which pointed out that 
customers can be sorted into different groups according to their different culture recognitions, level of income, and 
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consuming habits. Only through targeting different customer groups and providing them with customized products and 
services can firms efficiently compete over others using limited resources. Therefore in this paper, when doing 
preference analysis, we do not merely lump feedbacks from different users indiscriminately. Instead, we conduct a 
two-step clustering analysis that groups those users into different types. Clustering analysis is an ideal statistical 
technique which deals with multiple variables. Several types of clustering method are being used in numbers of research 
fields, among which the most frequently used is recognized as the hierarchical clustering, which is virtually a kind of 
connectivity model based on distance connectivity. Then there comes the K-means algorithm as well as its improved 
versions like the K-prototype and K-mode algorithms. These clustering algorithms represent each cluster by a single 
mean vector. Others include density models (DBSCAN and OPTICS), group models, subspace models and graph based 
models. The two-step clustering algorithm is a special tool primarily used when dealing with mixed variables which 
include both continuous and discrete variables. It is a featured tool belonging to hierarchical clustering. 

With respect to the customer clustering analysis, preference analysis and online education media studies per se (not 
necessarily involving the use of the two-step clustering method), there are several prior studies from which we get 
inspirations. Lu (2012) applies K-means clustering in management of customer relationship in the telecommunicating 
industry and brings forward several enlightening suggestions in terms of maintaining firms’ competence in optimizing 
customer relationship. Liu, Wang, Liu, and Zhong (2005) use a clustering-based method on mining text preference: the 
experience results indicate that this clustering-based method can present user text preference more precisely while it is 
not sensitive to relevance threshold. Regarding online education media, we basically review some literature concerning 
features of certain online education means since few preference analyses have been conducted in this field. Wang, 
Zhang, and Zhang (2013) choose seven different MOOC programs abroad as the samples and figure out the MOOC 
characteristics in terms of organizing mechanics, positioning, courses organizing, resources, teaching methods and 
quality accreditation, and put forward suggestions in developing the Chinese MOOC market based on their findings. 
Chen and Wang (2013) compare features of MOOC and traditional education means, and argue that key features of 
MOOC can be summarized as large-scaled, highly open, internationalized and innovative. 

We may notice that the literature mentioned above primarily focuses on a specific field, either in the application of the 
clustering method or the preference analysis, or in learning about a certain type of online education means. Up to now, 
there exists no synthesized analysis that, by linking all the related fields together, presents more comprehensive and 
accurate empirical results regarding the preferences of online education platform users. This paper aims to fill out the 
blank page and expand the current research scope by providing a thorough comparative analysis of four internet-based 
education means currently being widely used in China. 

3. Methodology 

We start by designing a questionnaire, which is later distributed to respondents in order to obtain a random sample of 
current or potential users of online education platforms. In this questionnaire, we present details of different 
characteristics of the four different types of online education media that are currently being widely used in China. Each 
respondent (seen as a current or potential user of online education) is expected to provide a subjective assessment for 
each type of the media by answering the questions and giving scores to those media. Based on the feedbacks of the 
respondents, we sort these users using a clustering analysis and then figure out their preferences and the influencing 
factors of each user cluster. The research framework can be portrayed in Figure 1. 



Figure 1. The Research Framework 
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4. The Research Procedure and Related Discussions 

4.1 Step 1: Defining the Variables 

We collect data on users’ characteristics through their answers to the questions designed. We define thirteen variables to 
describe their characteristics, which are summarized in Table 1. 


Table 1. Variables and Their Meanings 


Variable Name 


Description 


age User’s age 

occupation Ranges from 1 to 8 to describe a job field 

z- educ Z-Score of user’s educational degree: under College = 1; Bachelor = 2; Graduate or above = 3: Z-Score = 

(Value-l)/(3-l) 

major Range from 1 To 9, for 9 different major categories 

chmajor User has a plan to change major = 1; Otherwise = 0 

frapply User has a plan to apply for education abroad = 1; Otherwise = 0 

creditvl User cares about whether the online courses offer certificate of accomplishment = 1; Otherwise = 0 

flex User can manage their own time to study at least half an hour everyday = 1; Otherwise = 0 

follow User has habit of following the restricted timeline of courses = 1; User prefers choosing his/her own time of 

studying = 0 

ovteach User prefers teacher teaching = 1; User has no evident inclination of teacher teaching = 0 

ovsubset User values systematic department setting over other factors in choosing platforms = 1; User’s attitude is neutral = 0 

over spec User has big interest in certain area and has developed a habit of using certain related platforms =1; Otherwise = 0 

ovencyclo User prefers gathering comprehensive knowledge to learning profoundly in a specific area = 1; Otherwise = 0 


The descriptive statistics based on the data collected back, which contains 128 effective sample points, exhibit some 
interesting results that are shown in Tables 2 and 3. 

Table 2. (Question 6.) Do you have a plan to change your major? [Single-choice] 


Options 

Number 

Percentage 

A. Yes 

41 

32.03% 

B. No 

87 

67.97% 

Number of effective sample points 

128 

100% 


Table 3. (Question 17.) Do you have a preference for a certain field and have you already developed a habit of using a 
specific platform related to this field? (Such as IT, Software Engineering, Business, Foreign Language, etc.) 


[Single-choice] 

Options 

Number 

Percentage 

A. Yes 

34 

26.56% 

B. No 

94 

73.44% 

Number of effective sample points 

128 

100% 


As these results indicate, compared with the opposite side, the number of people who want to change their majors and 
have certain preferences for platforms concerning specific areas tends to be relatively small. Given the fact that 
including variables like occupation (which could be quite different from the user’s original major), major (which may 
not be as influencing as we have expected because we have seen that people having no apparent inclinations towards 
specific areas account for the majority of the whole sample), chmajor (which could also be interruptive given that only 
few people want to change their majors), ovspec (which has been proved to be unimportant in choosing platforms) 
would possibly drag down the quality of our cluster analysis by raising more problems than they could solve, we opt to 
remove these four variables, leaving only the remaining nine as our clustering base. 

4.2 Step 2: Two-step Clustering of the Users 

As the variables consist of both continuous and discrete variables, frequently used methods such as K-means clustering 
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may not be appropriate for processing the dataset. Therefore, we adopt a two-step clustering method, which is widely 
seen as supportive in analyzing mixed random variables. The users (respondents) are divided into four groups whose 
proportions are depicted by in Figures 2 and 3. 

The defining characteristics of the four groups are listed as follows based on their influencing power. 

Characteristics (ordered from the most to the least important) of Cluster 3 (accounting for 32.8%, which has the biggest 
size): 

A. Prefer acquiring comprehensive knowledge to learning profoundly in a specific area. 

B. Value systematic department setting over other factors in choosing platforms. 

C. Able to manage personal time to study at least half an hour daily. 

D. Have the habit of following restricted timelines of courses. 

E. Plan to apply for education abroad. 

F. Prefer to be taught by teachers. 

G. Value the certificate of accomplishment. 

H. Have an education level below the average, which is 0.63 (0.5 means bachelor degree). 

I. With an average age of 22.38 (the youngest group) 

Size of Clusters 



Minimum Size 

16 ( 12 . 5 %) 

Maximum Size 

42 ( 32 . 8 %) 

Max/M in 

2.62 


Figure 2. Relative Sizes of the Four Clusters in Percentages 

Characteristics (ordered from the most to the least important) of Cluster 1 (Accounting for 31.2%, which is the second 
largest): 

A. Prefer acquiring comprehensive knowledge to learning profoundly in a specific area. 

B. Neutral attitude towards systematic department setting. 

C. Cannot manage personal time to assure half an hour’s study daily. 

D. Have a habit of following the restricted timelines of courses. 

E. Have no plans for applying for education abroad. 

F. Prefer to be taught by teachers. 

G. Value the certificate of accomplishment. 

H. Have an education level hovering around the mean, which is 0.63 (0.5 means bachelor degree). 

I. With an average age of 23.32 (the second oldest) 

Characteristics (ordered from the most to the least important) of Cluster 2 (Accounting for 23.4%, which is at the third 
place): 

A. Prefer acquiring comprehensive knowledge to learning profoundly in a specific area. 

B. Value systematic department setting over other factors in choosing platforms. 
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C. Able to manage personal time to study at least half an hour daily. 

D. Prefer choosing own personal time for studying. 

E. Have no plans for applying for education abroad. 

F. Prefer to be taught by teachers. 

G. Value the certificate of accomplishment. 

H. Have an education level above the average, which is 0.63 (0.5 means bachelor degree). 

I. With an average age of 24.30 (the eldest) 

Clusters 

Importance of Variables Input 

□ l .0 □ Q.8 no.6 □ 0.4 IIlQ.2 IZIO.Q 



Figure 3. The Result of Clustering 

Characteristics (ordered from the most to the least important) of Cluster 4 (Accounting for 12.5%, which is the 
smallest): 

A. Have no evident inclination in learning comprehensive knowledge compared with choosing one specific area. 

B. Neutral attitude towards systematic department setting. 

C. Able to manage personal time to study at least half an hour daily. 

D. Prefer choosing own personal time for studying. 

E. Plan to apply for education abroad. 

F. Have no evident inclination of being taught by teacher. 

G. Do not value the certificate of accomplishment over other factors. 

H. Have an education level hovering around the average, which is 0.63 (0.5 means bachelor degree). 

I. With an average age of 23.12 (the second youngest). 

4.3 Step 3: Analyzing the Preferences of the Clusters 

Four types of internet-based education platforms are chosen and their relative characteristics are summarized below: 
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Table 4. A Detailed Description of the Four Types of Internet-Based Education Media 


Types 

Open online 

courses 

Multi-major mutual support 
/Resource sharing community 

Single-field forum 

Cloud platform 

Representative 

Special models 

MOOC 

Micro 

Courses 2 

One-man University 

The IT Home, Excel 
Home, Put club, etc. 
Open Source 
Community 3 

Aedu.cn 1 

Courses are mainly 
taught by teachers 

Yes 

Basically no 

Basically no 

To some extent but 
not a key feature 

Source of courses 

Mainly open 
courses 4 

Mainly videos 5 

Various sources 6 

Mainly videos 7 

Fields covered 

Broad 

Broad 

Focuses on a specific 
field 8 

Specific Courses 9 

Clear department setting 10 

No 

Yes 

No 

No 

Giving credits 

Yes 

No 

No 

No 

Restricted timelines 
of teaching schedule 

Yes 11 

No 

No 

No 

Group tasks and final examinations 

Yes 12 

Some but not mandatory 

No 

Some, but not 
mandatory 

Acknowledged by world renowned 
universities 

Yes 13 

Currently No 

No 

No 

Popularity worldwide 

High 

Low 

Low 

Low 

Serve well in assisting application 
for foreign education 

Yes 

Depends 

Depends 

Basically No 

Availability of certificates of 
accomplishment 

Yes 

No 

No 

No 

Affiliated forum and resource 
sharing platform 

Yes 

Yes 

Yes 

Yes 

Freedom of expression 

High 

High 

High 

Relatively low 

Offline supporting communities 

Few 

Many 

No 

No 

Contribution of students’ 
spontaneous uploading and sharing 

Less 

important 

Crucial 

Crucial 

Not important 


Users are asked to attach scores that are allowed to range from 1 to 10 to each type of online education media based on 
the comparison of the characteristics given above. We obtain the final result by screening the answers of each cluster 
and calculating the mean score for each of the four online education platforms. The type getting the highest score is 
identified as the cluster’s strongest preference. We then compare the characteristics of their first choice (presented above) 
with reasons why they make such a choice (given by the respondents), which could be collected with the final question 
in the questionnaire: 


1 It aims at forming a platform to support efficient communication between school, parents and students. It is basically the extended 
version of traditional schools based on the Internet. 

2 The duration ranges from five minutes to half an hour, mainly depending on spreading through mobilized media such as 
cell-phones. 

3 It concentrates on sharing resources and exchanging views in developing/using open source software. It is highly personalized 

4 These are mainly provided by world-renowned universities. 

5 They are recorded by the initiator and talents in specific fields (most of whom are top university students abroad). 

6 The courses provided can be quite flexible, depending on the resources uploaded by members. 

7 They are recorded by teachers of elementary/high schools aiming at assisting students in taking the standardized national 
examinations 

8 Examples of these fields include IT/Business/Foreign Languages. 

9 The courses mainly aim at assisting students in preparing standardized examinations 

10 Complete sets of reference books recommended are also available. 

11 Late submissions of assignments after the deadline are deemed invalid toward getting the certificate. 

12 Failing the final exam means no certificate granted. 

13 

More and more universities start to acknowledge the MOOC credits. 
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Table 5. (Question 21.) The reason for choosing X (which could be any one of the four types) as your top choice is that: 
[Multiple-choice] 


Options 


Number 


Percentage 


® Widely acknowledged by foreign universities and helpful in applying 
for education abroad 

(D Offers a certificate of accomplishment, which may serve well in 
achieving specific goals (such as for an application /changing majors) 

(3) Has strict rules in setting schedules and deadlines for submitting 
homework and is taught by teachers. 

@ Has systematic department setting for each major and a whole set of 
textbooks recommended 

(5) Comprehensive enough in covering knowledge of multi-fields which 
can help expand views and accumulate knowledge 

® Minimal limitations on study time, which suits me who has 
difficulties managing personal time 

© I am used to learning using mobilized tools and taking advantage of 
fragmental time: the micro courses set by MOOC really suit me 

(D I have a preference for certain fields so that I am inclined to use 
certain forums related to those fields 

® I have developed a habit of using a certain platform, which is hard to 
change 

Number of effective sample points 


55 

51 

50 

47 

68 

30 

37 

20 

7 

128 


42.97% 



5.47% 


36.72% 


53.13% 


23.44% 


28.91% 


15.63% 


5. The Final Results and Related Discussions 

The final results of our comparative analysis are presented below. 


Table 6. The Mean Scores of the Platform Types Given by Different Clusters 


Cluster 

Open online courses 

Multi-major mutual support /Resource sharing community 

Single-field forum 

Cloud platform 

1 

8.125 

6.625 

6.75 

6.125 

2 

8 

9 

7.5 

5 

3 

7.810 

7.667 

7.5 

5.5 

4 

6 

6.25 

7.5 

5.5 


A. The mean scores of the platform types given by users of Cluster 1: 

As is clearly demonstrated above, users of Cluster 1 have a significant preference for using open online courses, which 
can be represented by MOOC. According to their answers to question 21, which indicate the reasons why they make 
such a choice, we can see that the primary influencing factors can be stated as the following: (ordered by the times of 
choosing) 

a. Comprehensive enough in covering knowledge of multi-fields which can help expand views and accumulate knowledge; 

b. Offers a certificate of accomplishment, which may serve well in achieving specific goals; 

c. Has strict rules in setting schedules and deadlines for submitting homework and is taught by teachers; 

d. Be used to learning using mobilized tools and taking advantage of fragmental time, and value the micro courses set 
by MOOC. 

We see that preferring comprehensive knowledge serves as the most prominent feature of Cluster 1. Users in Cluster 1 
also value certificates for specific reasons other than foreign education application because cluster 1 mainly consists of 
people who have no plans for studying abroad. They tend to like the MOOC type more simply because of its 
self-discipline mechanisms and assured teachers-teaching. Another important point that should be noted is that they 
have developed a habit of studying based on fragmental time and they tend to like micro-courses, which is a special way 
of the MOOC type. 
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B. The mean scores of the platform types given by users of Cluster 2: 

Statistics show that the choice of multi-major mutual support/resource sharing communities, which can be represented 
by the “one-man university”, is mostly valued by users of Cluster 2. The main reasons can be listed below: 

a. Comprehensive enough in covering knowledge of multi-fields which can help expand views and accumulate knowledge; 

b. Has systematic department setting for each major and a whole set of textbooks recommended; 

c. Minimal limitations on study time to allow for a flexible schedule for studying. 

Cluster 2 has an apparent preference toward the one-man university type. Combined with their characteristics, we may 
conclude that this is because they value systematic department setting more than any other factors in choosing a 
platform, which could only be offered by the one-man type. They have no demands of applying for foreign education, 
which means certificates and fame of the platform are not necessary. Regardless of their personal interest in certificates 
and teachers-teaching (which simply appears to be a relatively less important personal characteristic presented by the 
order above), their preferences for the one-man university type over MOOC can be taken as plausible. Also, they are 
inclined to manage their time of studying on a flexible basis but unfortunately MOOC stands at the very opposite side. 
These people have an inclination of learning comprehensive knowledge, which may (partly) explain why they value 
MOOC more than the single-field forum. 

C. The mean scores of the platform types given by users of Cluster 3: 

As shown above, users of cluster 3 value the MOOC type over other types of platforms, but the mean score of MOOC is 
fairly close to that of the one-man university type, being only about 0.2 points higher. The reasons given by them are 
listed as follows: 

a. Comprehensive enough in covering knowledge of multi-fields which can help expand views and accumulate knowledge; 

b. Widely acknowledged by foreign universities and helpful in applying for education abroad; 

c. Offers a certificate of accomplishment, which may serve well in achieving specific goals; 

d. Has strict rules in setting schedules and deadlines for submitting homework and is taught by teachers; 

e. Has systematic department setting for each major and a whole set of textbooks recommended. 

We may conclude from the descriptive information of Cluster 3 that these people have certain needs such as applying 
for education abroad, etc. They are more inclined to be attracted by comprehensive knowledge, which is not uncommon 
among potential international students who always possess a feature of broad interests and high curiosities. Although 
these people value systematic department setting over other factors, their needs of (for example) going abroad may very 
likely dominate their personal interests. That is, they have to choose the MOOC type because of its wider 
acknowledgement among foreign universities as well as a certificate, which will presumably play a crucial role in 
supporting their application. These users also value the mechanism of self-discipline, cross-supervision and the assured 
education quality provided by the MOOC type, all of which tend to serve their application fairly well. Therefore, we are 
not surprised at their first choice. 

D. The mean scores of the platform types given by users of Cluster 4: 

The figures indicate that users of Cluster 4 seem more inclined to using the single-field forum rather than any other 
media. The reasons given by them turn out to be the following: 

a. Minimal limitations on study time to allow for a flexible schedule for studying; 

b. Preference for a certain field so that the user is inclined to use certain forums related to that field; 

c. A habit of using a certain platform, which is hard to change. 

Combined with the characteristics of Cluster 4 described earlier, we find that these people have no apparent inclination 
towards many things, such as comprehensive knowledge, teachers-teaching and certificate of accomplishment, etc. 
They also have a neutral attitude towards systematic department setting. As a matter of fact, we spot that all mean 
scores given by these people tend to be lower than other clusters, which may probably indicate a more carefree and 
indifferent attitude of theirs towards the media features. It seems that they overvalue the freedom of managing personal 
time rather than the media type features, as we may see from the descriptions presented above about Cluster 4, they tend 
to choose their own time of studying and they also have the ability to manage their personal time. The reasons given by 
them show a match with our analysis and give further information showing their specific preferences for some certain 
fields. They may have developed a habit of using platforms of special areas, which is hard to change. These users, who 
have certain personal interests and have no need of applying for foreign education, are more likely to be IT 
programmers or engineering zealots. This conjecture is supported by the occupation information given by Cluster 4. 
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6. Conclusion 

In this paper, we apply a two-step clustering analysis to sort our respondents (seen as current and potential users of 
internet-based education media) into four clusters, based on which we ultimately figure out the preferences of each 
cluster for four different types of internet-based education platforms (respectively) currently being widely used in China. 
The main conclusion we have reached has great implications for strategic planning over developing new forms of 
internet-based education means and improving the existing means. 

Through our comparative analysis, we can conclude that factors such as the certificate and acknowledgement are key 
factors in affecting the choices of people who have special needs like going abroad or changing their majors while 
factors such as department setting and teachers-teaching can be of great value for people who accentuate education 
quality. It should be noted that the factors we have discussed here should always be studied together with people’s 
habits concerning the management of their studying time, e.g. whether they have flexible time or not or whether they 
are willing to follow rules in terms of timelines. 

A minor issue we would like to mention in passing before we end this paper is about the lower preferences shown by all 
the clusters for the last type of media — the cloud platform. Our conjecture of the possible reason is that this platform 
only serves as an online extension of traditional schools, at least judged from its current way of management. This 
implies that it actually has low involvement with the internet spirit such as freedom, students’ initiative, and 
resource-sharing with no boundaries. Still, the cloud platform is more likely to be dominated by teachers, often failing 
to provide students with adequate involvement. Such an online platform should have a long way to go in terms of 
bringing in the real internet spirit before it can be regarded as an effective means of internet-based education. 
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